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ABSTRACT 

A number of methods of flare prediction rely on classification of physical 
characteristics of an active region, in particular optical classification of sunspots, 
and historical rates of flaring for a given classification. However these methods 
largely ignore the number of flares the active region has already produced, in 
particular the number of small events. The past history of occurrence of flares 
(of all sizes) is an important indicator to future flare production. We present a 
Bayesian approach to flare prediction, which uses the flaring record of an active 
region together with phenomenological rules of flare statistics to refine an initial 
prediction for the occurrence of a big flare during a subsequent period of time. 
The initial prediction is assumed to come from one of the extant methods of flare 
prediction. The theory of the method is outlined, and simulations are presented 
to show how the refinement step of the method works in practice. 

Subject headings: Sun: activity — Sun: flares — Sun: X-rays — methods: 
statistical 



1. Introduction 

Solar flares influence local 'space weather,' and as a result there is a demand for accurate 
flare prediction. Unfortunately no reliable deterministic method of predicting a flare is 
known, and existing methods are probabilistic in nature. 

A number of methods discussed in the literature are based on a commonly used white- 
light classification of sunspots, and the correlation between classification and flare occurrence. 
The Mcintosh classification (Mcintosh 1990) categorizes a group of sunspots into one of 60 
classes, based on three parameters. Historical flare rates for each of the classifications were 
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used by Mcintosh (1990) as the basis of an 'expert system' for flare prediction. The sys- 
tem, called Theophrastus (the associated code is called THEO), also incorporates additional 
information including dynamical properties of spot growth, rotation and shear, magnetic 
topology inferred from sunspot structure, magnetic classification, and previous flare activity. 
The method is apparently somewhat subjective, involving rules of thumb incorporated by 
a human expert. A second approach using the Mcintosh classification was presented by 
Bornmann and Shaw (1994). In this case multiple linear regression was used to determine 
the effective contribution of each of the Mcintosh parameters to the rate of flaring, based 
on historical records of flaring. Codes based on the methods of Mcintosh (1990) and Born- 
mann and Shaw (1994) are used by the Ionospheric Prediction Service (IPS) of Australia to 
issue flare predictions at their Lcarmonth and Culgoora observatories.^ Recently Gallagher, 
Moon and Wang (2002) implemented a system using historical averages of flare numbers for 
Mcintosh classifications to predict a rate for an active region, and then converted this to a 
probability of fiaring in a day using the assumption of Poisson statistics. This prediction 
is given as part of the Big Bear Solar Observatory Active Region Monitor (ARM).^ Finally 
the US National Oceanic and Atmospheric Administration (NOAA) issues flare probability 
forecasts for active regions which include input from THEO.^ 

A shortcoming of methods relying on correlations of flaring with active region classiflca- 
tion based on historical records is that they ignore the important information of how many 
flares the active region of interest has already produced. The system of Mcintosh (1990) 
incorporates information about previous activity, but it is unclear how objectively this is 
done, and the information is limited to the number of large flares already produced by the 
given active region. In the flare prediction literature, the tendency of a region which has 
produced large flares in the past to produce large flares in the future is called persistence, 
which is recognised as one of the most reliable predictors for large flare occurrence in 24-hour 
forecasts (e.g. Ncidig, Wciborg, & Scagravcs 1989). In this paper we argue that the history 
of occurrence of all flares (large and small) observed in a given active region is an important 
indicator as to how the region will flare in the future, and should be used in any prediction. 
A related criticism of methods based on classiflcation and historical records is that a given 
classiflcation may embrace active regions with a variety of flaring rates. If an active region 
has a flaring rate differing from the average historical rate for its class then the predictions 
will be in error. 



^See http://www.ips.gov.au. 

^See http: / /beauty.nascom.nasa.gov/ arm/latest/. 

^See http://www.sec.noaa.gov/ftpdir/latest/daypre.txt. 
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Studies of solar flare statistics provide simple phenomenological rules describing flare 
occurrence. It is well known that flares follow a power-law size distribution, where by size we 
mean e.g. peak flux in soft X-ray. More formally the flare frequency-size distribution A^(<S') 
(i.e. the number of events per unit size S and per unit time) may be written 

N{S) = AS-^ (1) 

where A and 7 are constants. The exact power-law index 7 depends on the choice of the 
quantity S, but typically it is found to be in the range 1.5 to 2 (e.g. Crosby, Aschwan- 
den, & Dennis 1992). The power law index 7 appears to be the same in different active 
regions (Wheatland 2000), although there is some evidence that it varies with the solar cy- 
cle (Bai 1993). A second simple rule concerns the way flares occur in time. Studies of the 
rate of occurrence of soft X-ray flares in individual active regions suggest that events occur 
as a Poisson process in time (e.g. Moon et al. 2001), although many active regions exhibit 
changes in the mean rate of events (Wheatland 2001). 

In this paper we show how the observed record of flaring in an active region may be used 
together with the phenomenological rules of flare statistics to objectively reflne an initial flare 
prediction. The initial prediction may be based on the Mcintosh classification, or may come 
from any other prediction method which does not consider the flare data. The new method is 
envisaged to work as follows. When an active region appears at the east limb of the Sun, the 
best guess as to its future flare productivity comes from one of the conventional prediction 
methods. However, as the active region produces flares, the observed flare statistics are 
used to adjust the prediction for future flaring. After many flares have been observed, the 
prediction for future flaring may be dominated by the contribution from the observed data. 
This process — reflning a probability estimate based on new data — is naturally performed 
using Bayes's theorem (e.g. Sivia 1996; Jaynes 2003). 

The layout of the paper is as follows. In § 2 a simple approach to flare prediction using 
only the past record of flaring from an active region [previously presented in Wheatland 
(2001)] is reiterated. In §3 the new method of prediction, combining existing methods 
and information from observed flare statistics, is described. In § 4 simulations are presented 
showing how the method uses the observed flaring record, and in § 5 the results are discussed. 

2. Wheatland (2001) 

Wheatland (2001) presented a method for flare prediction using only observed flare 
statistics and the assumptions that flares obey Poisson statistics in time, and power-law 
statistics in size, elaborating on a suggestion by Moon et al. (2001). The approach is briefly 
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reiterated here, since it is part of the new method. 

First assume that there is a threshold size Si above which all events occurring in an 
active region are observed, so that the distribution (1) applies for events above that size. 
The total rate of events larger than Si is then 

POO 

Xi= N{S)dS^A{^-l)-'Si'+\ (2) 

JSi 

assuming 7 > 1. Hence the frequency-size distribution may be rewritten 

N{S) = Xi{^-l)Sr'S-\ (3) 

Suppose the probability of a big event in a given period AT is required, where by big we 
mean an event at least as large as S2. According to the distribution (3) the rate of events 
larger than S2 is 



Applying the Poisson model of flare occurrence, the probability of at least one big event 
during a period AT is given by Poisson statistics as 

e = 1 -exp(-A2AT). (5) 

Equations (4) and (5) provide the required estimate. The quantities Si, S2 and AT are 
chosen, and then the parameters Ai and 7 (if the precise value of 7 is assumed unknown) 
need to be estimated from the past history of flaring of the active region. Wheatland (2001) 
assumed that 7 is the same for all active regions, and hence known (see Wheatland 2000), 
and estimated Ai using the Bayesian procedure of Scargle (1998). 

The rationale behind the method of Wheatland (2001) is that the flare frequency-size 
distribution is steep so there are very many small events, which allows Ai to be estimated 
relatively accurately from the observed history of flaring in an active region. Hence the 
estimate of e should be relatively accurate. To make this point quantitative, note that 
from Equations (4) and (5) the uncertainty in the estimate of the probability e is given 
approximately by 

a, ^ \iAT{Si/S2r-' ai , . 

e exp[AiAT(5i/52)^-i]-lAi' ^ 
where ai is the uncertainty in Ai, and where we have ignored any uncertainty in 7. Assuming 
5*2 ^ 5*1 leads to a^/e ~ o"i/Ai. If the rate Ai is determined from M observed events, then 
for Poisson statistics we expect cxi/Ai = M^^/^, and hence 

^ ^ M-V2. (7) 
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Equation (7) provides a crude estimate of the accuracy of the method. To achieve a 10% 
accuracy in the estimate requires of order 100 observed events. 

3. New method 
3.1. Approach 

The Wheatland (2001) method shows how to use the flaring record for an active region 
to make a flare prediction, but it ignores the other information which is normally the basis 
of prediction. It is sensible to combine all of the available information, and in this section 
we consider how to do this. 

We assume that a sequence of events with sizes si, S2, sm (all larger than 5*1) are 
observed to occur at times ti < t2 < ... < tM respectively in an active region. These 
events occur within an observing interval which starts at time tgta and ends at time tend- 
We also have additional information, which we label /, including our knowledge of the 
phenomenological rules of flare statistics, and e.g. the Mcintosh classiflcation of the active 
region. The problem is then to estimate e, the probability of a big event, based on the 
data and the additional information I. By 'estimating e' we strictly mean that we want to 
calculate a probability distribution for the quantity e, based on the available information. 
The peak of this distribution is our most likely value for the probability of occurrence of a 
big flare, and the width of the distribution is a measure of the uncertainty of that value. To 
do this we proceed as follows. First we estimate (calculate probability distributions for) Ai 
and 7 based on the available information, and then we use these distributions to estimate 
A2. Then we use this distribution together with the relationship (5) to estimate the desired 
quantity e. We now consider each of these steps in turn. 

3.2. Estimating 7 

First we consider the calculation of ^7(7), the probability distribution for the power-law 
index 7.^ As mentioned in the Introduction, Wheatland (2000) found that the index 7 is 
independent of active region for a set of hard X-ray events, although the statistics underlying 
the study were somewhat poor. If 7 is the same in all active regions then the observations 



^In the following probability distributions are given labels such as Pj (7) when the actual functional form 
of the distribution is needed. When this is not the case the generic label prob(...) is used to denote a 
distribution. 
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si, S2, sm can be replaced by a larger set of events over many active regions. We return 
to this point in §3.4, but for now admit the possibility that 7 is different in different active 
regions, and consider its estimation based on data for the given active region alone. 

Bai (1993) has shown how to estimate a power-law index for a set of data, using 'max- 
imum hkelihood'. Following Bai, the likehhood function, that is the probabihty of the 
observed data D — {si, S2, sm} given the model, is (assuming 7 > 1) 

M 

prob(i^|7,/) (X - l){si/S^)-\ (8) 

1=1 

where / stands for all additional information, including knowledge of the phenomenological 
rule (1). Wc note that this expression requires 7 > 1, which follows from the requirement 
that the probability distribution for size S is normalized over all S larger than Si. It is 
not necessary to introduce an upper cutoff for S in the present treatment (provided 7 > 1), 
although an upper cutoff is necessary to ensure that the mean flare size is finite, if 7 < 2. 
We will return to this point in § 5. 

Hayes's theorem may be used to convert the likelihood into the probability of the model 
given the data, which is what we are interested in: 

prob(7|£), /) oc prob(£)|7, /) x prob(7, /), (9) 

where prob(7, 1) is the 'prior distribution' for 7, i.e. the distribution we would assign to 7 
in the absence of the data (e.g. Sivia 1996). A choice needs to be made for this distribution, 
and a common choice is to assume a constant value within minimum and maximum values 
7i and 72 respectively: 

p.ob(,i...)^{ 2;^^^^^ (10) 

which is referred to as a 'uniform prior'. Wc note that for a uniform prior the most likely 
value of 7 is the maximum of the likelihood function: 

Ei=iln(si/5i) 

which is the maximum likelihood estimate of 7 found by Bai. 

We can identify prob(7|L>,/) with -Ry(7), and then Equations (8) and (9) give the 
required 'posterior distribution' for 7: 

^7(7) = ^^^^^^r(7), (12) 
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where 

M 

and where we have relabelled the prior distribution r(7). The normalizing factor C is 
determined by the requirement P^{'y)d'y — 1.^ For a uniform prior the integral may be 
performed, leading to 

C ^ (72-7i)^(ln7r)^+VM! 

P[M+l,(72-l)ln7r] -P[M + 1,(71 -1) In tt]' ^ ^ 

where P{a,x) denotes the incomplete Gamma function (Abramowitz and Stegun 1964). 

Before proceeding we present a rough estimate of the uncertainty in the most likely value 
of 7 based on the distribution ^7(7) with a uniform prior. Assuming Gaussian behavior in 
the vicinity of the peak, the width of the distribution (12) is ~ [L"(7*)]~^/^, where 
L(7) = — lnP^(7), and where 7* is the location of the peak of the distribution (Sivia 1996). 
This leads to Is/P-I'^ j Invr, and using Equation (11) gives 

(7^ ^ (7* - l)M-^/2. (15) 



3.3. Estimating Ai 

Next we consider the calculation of Pi(Ai), the distribution of the rate Ai of flares larger 
than 5"!. This is a more difficult problem because the rate of flaring in an active region may 
vary with time (see e.g. Wheatland 2001). However, observations suggest that a piecewise- 
constant Poisson process provides a good model for the way flares occur in time in individual 
active regions. 

We assume that a period of time of duration T' <T immediately prior to tend is identified 
(i.e. from t = tend — T' to t — tend) during which time flare occurrence is consistent with a 
constant-rate Poisson process. 

One approach to identifying the necessary period of time has been presented by Scargle 
(1998), who showed how to select a piecewise-constant Poisson model to describe an observed 
sequence of events. When applied to a sequence of events at times ti < t2 < ... < tM 
the Scargle method gives a sequence of times tBo < ^bi < ■■■ < tsK at which the rate is 



^In the following all normalizing factors arc labelled C, although they refer to different values. It is 
understood that in each case the value C is to be determined by integration. 
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determined to change (where tso — tsta. and tBK — tend are the start and end of the observing 
period), and a corresponding sequence A^i, \b2, ^bk of rates. The sequence of times and 
rates is called a set of 'Bayesian blocks'. In this case we identify T' with tsK — iB{K-i)- We 
note that the original Bayesian blocks procedure [which was used e.g. by Wheatland (2001)] 
does not necessarily select the best piecewise-constant model. Recently Scargle has found 
a computationally feasible way to determine the optimal decomposition (Scargle, private 
communication, 2003). We begin by assuming this method (or another method) has been 
applied to the data, to determine the required period T' prior to the end of observations. 

A probability distribution for the rate Ai is then be determined as follows. We assume 
that M' < M events are observed during the selected period T' . The probability of the 
observed data D' (strictly this comprises not just the number of events but also their times) 
given a Poisson model with rate Ai is 

prob(D'|Ai,/) oc Af e-^i^', (16) 

where we retain only the dependence on Ai on the right hand side of this equation, and 
where we formally recognise any additional information by the dependence on I. Bayes's 
theorem may be used to turn this likelihood into a probability of the model given the data, 
and the additional information: 

prob(Ai|D', /) oc prob(L''|Ai, /) x prob(Ai, /), (17) 

where prob(Ai, /) is the prior distribution for the rate. 

The prior distribution prob(Ai, /) represents the estimate of the rate of flaring for the 
active region in the absence of any data. This distribution allows the incorporation of any 
additional information we have about the expected rate of flaring, not including the actual 
data. To make this concrete, we will consider the case that the additional information is 
the Mcintosh classification of the sunspots associated with the active region, although we 
stress that any other additional information can also be incorporated. When the additional 
information is the Mcintosh classification, a suitable prior distribution can be constructed 
from historical records of the observed rates of events above size 5*1 for every active region 
of the same class. This is a generalization of the analysis underlying present flare predic- 
tion methods based on Mcintosh classification, which considers only the mean flaring rate 
extracted from historical data. Hence we propose the construction of distributions of flaring 
rate for each Mcintosh classification. We assume these are available, and label the appro- 
priate distribution Amc(Ai), where MC denotes Mcintosh classification. Equation (17) then 
becomes 

Pi(Ai) = CAf'e-^^^'AMc(Ai), (18) 
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where we have identified proh{Xi\D' , I) with Pi(Ai), and and where C is the normahzation 
factor. This is the required posterior distribution for Ai. 

It should be noted that the distribution (18) exphcitly uses only a subset of all flares 
observed in an active region, i.e. the M' < M flares observed during the interval T' < T. 
Previous data contribute only to the determination of the interval T'. The motivation is that 
when the rate changes, the old rate is no longer relevant for future prediction. For many 
active regions the observed rate appears to be constant during a transit of the disk, or at 
least no rate change is detectable (e.g. Wheatland 2001), in which case all observed flares 
contribute explicitly to the inference. 

Before proceeding we note two simple results for Equation (18) with a uniform prior. 
First, it is easy to see that with a uniform prior the maximum of this distribution occurs at 
M'/T'. Second we note the well known result that for large AiT' and neglecting the prior. 
Equation (18) approximates a Gaussian with a width 

^1 ^ (19) 
which is consistent with the arguments at the end of § 2. 



3.4. Estimating e 

The probability distribution P2(A2) for the rate A2 of flares larger than 5*2 may be 
constructed from the distributions -Pi(Ai) and ^7(7) using Equation (4). Specifically we 
have A2 = Ai(5'i/5'2)'''~^, and hence 

/OO /"OO 
d7 dAiPi(Ai)P^(7)5[A2-Ai(V'52)^-'], (20) 

and performing the integral over Ai leads to 

The quantity we are interested in is e, the probability of an event bigger than S2 occur- 
ring in an interval AT. The probability distribution Pe(e) for this quantity may be contructed 
from the distribution for A2 by a change of variable. Specifically, from Equation (5) we have 
A2 = - ln(l - e)/AT, and hence 



7-1 



(21) 



Pe{e) = P2[A2(6)] 



de 
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ln(l - e) 
AT 



1 

Ar(l-e)' 



(22) 



Using Equations (12), (18), and (21) in (22) leads to 

/oo 



(23) 



where 



/(6,7) = C[-ln(l-e)]^'(7-l)^^r(7) 

'I _ g){T'/AT)(52/Si)^-i-l^ 



(^2/5'i)^'^-^'+l 



TT 



X 



^MC 



Infl 



AT \Si 



7-1 



(24) 



is the joint probabihty distribution for e and 7. The normahzation factor C is obtained 
by requiring that P£(e)(ie = 1. We note that ^7(7) and -Pe(e) may be considered to be 
marginal distributions of /(e,7) (i.e. they are obtained by integration over e and 7 respec- 
tively). However, Equation (12) gives the distribution for 7 directly. 

As noted in §3.2, observations suggest that 7 is the same in all active regions, in which 
case the index can be determined very accurately from events over many active regions using 
Equation (11). If the estimate is 7*, then we can consider the prior distribution for 7 to be 
r(7) = 5(7 — 7*), and Equation (23) simplifies to 



P,(e)=C[-ln(l-e)]^'(l-e) 



v(r7AT)(52/5i) 



MC 



ln(l I 02 



AT 



(25) 



Equations (23), (24) and (25) are the required expressions for the posterior probability 
distribution for e. 



4. Simulations 

We present two simulations demonstrating the application of the method to synthetic 
data. These simulations omit the inclusion of other information via the prior A]v[c(Ai), so 
they illustrate only how the method performs using the observed data. 

First we consider the case that 7 is assumed to be known. Ten days of flaring were 
simulated by producing a sequence of event times as a Poisson process in time with a rate 
Ai = 0.5 per day for the first five days, and with a rate Ai = 5.0 per day for the second five 
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days. Each event was assigned a size according to a power law distribution with an index 
7 = 1.8, above the threshold size 5*1 = 1 (in arbitrary units). Figure 1 illustrates a typical 
simulation. The first (upper) panel shows the size of each event versus the time at which 
the event occurred. In this case there were 31 events. The simulation applies the method 
to the problem of predicting the probability of a big event occurring during the next day 
(AT = 1 day) at the end of the ten days. The size of a big event was taken to be S2 = 100. 
The original Bayesian blocks procedure (Scargle 1998) was applied to the event time series 
to determine a decomposition into a sequence of piecewise-constant intervals and rates. The 
second panel of Figure 1 shows the result of this process: the sohd hues indicate the rate as a 
function of time inferred by the Bayesian blocks procedure, and the dotted lines indicate the 
true rate versus time. The Bayesian blocks procedure correctly identifies a two-rate model as 
the most likely model, and identifies the approximate time of the change in rate. The third 
panel shows the probability distribution Pe(e) obtained from Equation (25) with a uniform 
prior for Ai, and with M' and T' equal to the number of events in the second Bayesian block 
and the duration of the second Bayesian block respectively. The dotted vertical line in this 
panel is the true value of e. We see that, even for a relatively small number of events, the 
method is able to provide a good estimate of the probabihty of a big event. The width of 
the inferred distribution for e is consistent with Equation (7). 

Second we consider the more difficult case of simultaneously estimating 7 and Ai. Ten 
days of flaring were again simulated, with a rate Ai = 1 per day for the first five days, 
and a rate Ai = 10 per day for the second five days. Larger rates were chosen to provide 
more events for the inference, but the other parameters were kept the same as in the first 
simulation. Figure 2 illustrates the results of a typical simulation. The first (upper) panel 
shows the time history of events — in this case 57 events occurred. The second panel shows 
the result of a Bayesian blocks decomposition of the data (solid lines) together with the true 
rate versus time (dotted lines). Once again the Bayesian blocks procedure correctly identifies 
a two-rate model as the most likely model, and identifies the approximate time of the change 
in rate. The third panel shows the result of using Equation (12) — with a uniform prior with 
71 = 1.25 and 72 = 2.25 — to construct the distribution for 7. The dotted vertical line in 
this panel shows the true value of 7. The fourth panel of Figure 2 shows the distribution for 
e constructed using Equation (23), with M = 57, with M' and T' obtained from the second 
Bayesian block, and with uniform prior distributions for 7 and Ai. The dotted vertical line 
indicates the true value. From this simulation we see that a reasonable estimate for e is 
obtained for a relatively small number of events. 

The distribution for e obtained in the lower panel of Figure 2 is quite broad. A ba- 
sic reason is that e depends sensitively on 7 because of its appearance as an exponent in 
Equation (4), and 7 has a range of possible values, as shown in the third panel of Figure 2. 
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Fig. 1. — Simulation of 10 days of flaring and application of the prediction method, assuming 
7 is known. 
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Fig. 2. — Simulation of 10 days of flaring and application of the prediction method, assuming 
7 is unknown. 
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This effect may be seen by considering /(e, 7) [defined by Equation (24)], which is the joint 
distribution of e and 7. Figure 3 shows a contour plot of /(e, 7) for the simulation depicted in 
Figure 2. The dotted vertical and horizontal lines are the true values of e and 7 respectively. 
The dashed curve is defined by e = 1 — cxp[— (M'/T')(S'i/S'2)'''^^AT], and the contours of 
/(e, 7) are observed to be stretched out along this curve. The practical implication of this 
figure is that accurate estimation of e depends on accurate estimation of 7. In practice 7 is 
known a priori quite accurately, but in this simulation we have assumed that 7 is initially 
unknown (within the range 1.25 to 2.25), to illustrate the process of inference. 



5. Discussion 

Existing methods of solar fiarc prediction do not make complete use of an important 
source of information: the time history of flares already observed in the active region of 
interest, in particular frequently occurring small events. A new method for flare prediction 
is presented which exploits the observed history of flaring from an active region to improve 
an initial prediction, which e.g. may come from one of the existing methods. To make the 
example concrete we may think of the initial prediction coming from from the Mcintosh 
sunspot classification, which is a common basis for prediction. This background information 
provides an initial estimate for the expected flaring rate through a prior distribution Amc('^i)! 
which represents the probability that the flaring rate above a (small) size Si is Ai, given 
historical rates of occurrence of flares for the given Mcintosh class. Bayes's theorem is then 
used to estimate the probability e of observing a large flare (above size 5*2) in a given period 
of time, based on this prior information and on the sequence of flares already produced by 
the active region, and assuming simple phenomenological rules describing the occurrence 
of flares. In this paper the basic theory behind the inference of e based on observed data 
is presented. The inclusion of background information [i.e. the construction of the priors 
Amc(Ai)] is yet to be done. 

The method rehes on event sizes following the phenomenological law (1). Some studies of 
very small extreme ultraviolet events ('nanoflares') suggest that their thermal energies follow 
a steeper distribution than energies of large events (e.g. Krucker and Benz 1998; Parnell and 
Jupp 2000), although this remains controversial (e.g. Aschwanden and Parnell 2002). From 
the point of view of the prediction method presented here, the uncertainty over the low-size 
end of the distribution is irrelevant provided events signiflcantly larger than nanoflares are 
used. In any case the observed distributions from many active regions may be examined as a 
check on Equation (1). A related point is that the distribution (1) requires a cutoff at large 
sizes on energetics grounds, and neglect of this cutoff will lead to the number of large flares 
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being overestimated. A cutoff will be incorporated before the method is applied to real data. 

The choice of the quantity S has not been addressed, although a good choice is likely 
to be important to the method. Most flare forecasting deals with soft X-ray events, in 
particular prediction of GOES (Geostationary Observational Environmental Satellite) M and 
X class events (events with peak fluxes greater than 10~^W/m^ and 10~^W/m^ respectively 
in the 1-8 Angstrom band observed by the satellites). A practical motivation for this is that 
flare soft X-ray emission causes disturbances of the ionosphere which affect shortwave radio 
communication, and there is a need to predict these occurrences. A disadvantage of using 
GOES events is that they are not ideal for flare statistics e.g. because of problems with event 
selection due to the large background in soft X-ray (see Wheatland 2001). 

A number of other issues also need to be considered before the method is implemented 
with real data. A point neglected so far is that active regions evolve, so that predictions 
based on the traditional methods also change with time. For example, an active region 
evolves through Mcintosh classifications (e.g. Bornmann, Kalmbach, Kulhanek, and Casale 
1990). Changes in background information such as this should be incorporated through 
changes in the prior, and this question will be considered in more detail in future work. A 
related point concerns the construction of the prior distributions for rate. It is likely that 
the Mcintosh classification will be used, although other possibilities will be considered. The 
problem is then to determine the probability of a given Mcintosh class having a given rate, 
based on observed fiaring sequences in the historical record for active regions of that class. 
The details of this calculation will be addressed in future work. 

Finally, as with all methods of forecasting, it is essential to test the reliability of the 
method. It is straightforward to compare, after the fact, the number of predicted and the 
number of observed events for a large sample of active regions. The method presented here 
will be implemented and tested in this way, and the results compared with existing methods 
of prediction. 
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Fig. 3. — Contour map of the joint probability of e and 7, for the simulation in Fig. 2. 



